Posting Compression in Dynamic Retrieval Environments Posting Compression in Dynamic Retried Environments
نویسنده
چکیده
This paper describes a posting compression technique to be used in dynamic full-text document retrieval environments. The compression technique being presented is applicable in main-memory document retrieval systems, and consists of two parts. First there is the efficient use of auxiliary tables, and second there is the application of the well-known rank-frequency law of Zipf. It is shown that on the basis of this law term weights can be approximated, and thus that their explicit storage can be avoided. The underlying retrieval model being used is the vector space retrieval model, from which it is already known that it guarantees a high retrieval quality. Prior to our compression results, in this paper we also present some minor modifications to the vector space retrieval model, improving its quality slightly.
منابع مشابه
Chaotic Genetic Algorithm based on Explicit Memory with a new Strategy for Updating and Retrieval of Memory in Dynamic Environments
Many of the problems considered in optimization and learning assume that solutions exist in a dynamic. Hence, algorithms are required that dynamically adapt with the problem’s conditions and search new conditions. Mostly, utilization of information from the past allows to quickly adapting changes after. This is the idea underlining the use of memory in this field, what involves key design issue...
متن کاملOn Inverted Index Compression for Search Engine Efficiency
Efficient access to the inverted index data structure is a key aspect for a search engine to achieve fast response times to users’ queries. While the performance of an information retrieval (IR) system can be enhanced through the compression of its posting lists, there is little recent work in the literature that thoroughly compares and analyses the performance of modern integer compression sch...
متن کاملOngoing standardization efforts
The JPEG committee has been continuing its activities on definition of international standards related to still image coding systems in order to prepare the ground for interoperable solutions, products and services, which make use of digital images. Currently, JPEG committee is pursuing three such efforts, namely, JPSearch, which deals with annotation, search and retrieval of digital images and...
متن کاملObject-Relational Database Representations for Text Indexing
One of the distinctive features of Information Retrieval systems comparing to Database Management systems, is that they offer better compression for posting lists, resulting in better I/O performance and thus faster query evaluation. In this paper, we introduce database representations of the index that reduce the size (and thus the disk I/Os) of the posting lists. This is not achieved by redes...
متن کاملDynamic recrystallization kinetics of AISI 403 stainless steel using hot compression test
In this work dynamic recrystallization behavior of AISI 403 martensitic stainless steel was studied using hot compression tests over temperature range of 900 C -1200 C and strain rate range of 0.001 s-1 - 1 s-1. The obtained flow curves showed that the hot compression behavior of the alloy is controlled by dynamic recrystallization. The flow stress and strain corresponding to the critical, pe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991